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Abstract 

We study kernelization (a kind of efficient preprocessing) for NP-hard problems on planar 
graphs. Our main result is a kernel of size at most 9k vertices for the Planar Maximum 
Nonseparating Independent Set problem. A direct consequence of this result is that 
Planar Connected Vertex Cover has no kernel with at most (9/8 - e)k vertices, for 
any e > 0, assuming P ^ NP. We also show a very simple 5fc-vertices kernel for Planar 
Max Leaf, which results in a lower bound of (5/4 — e)k vertices for the kernel of Planar 
Connected Dominating Set (also under P ^ NP). 

As a by-product we show a few extremal graph theory results which might be of inde- 
pendent interest. We prove that graphs that contain no separator consisting of only degree 
two vertices contain (a) a spanning tree with at least n/4 leaves and (b) a nonseparating 
independent set of size at least n/9 (also, equivalently, a connected vertex cover of size at 
most |n). The result (a) is a generalization of a theorem of Kleitman and West S who 
showed the same bound for graphs of minimum degree three. Finally we show that every 
n- vertex outerplanar graph contains an independent set / and a collection of vertex-disjoint 
cycles e such that 9|/| > An - 3|e|. 

1 Introduction 

Many NP-complete problems, while most likely not solvable efficiently, admit kernelization 
algorithms, i.e. efficient algorithms which replace input instances with an equivalent, but often 
much smaller one. More precisely, a kernelization algorithm takes an instance / of size n and 
a parameter /c € N, and after time polynomial in n it outputs an instance I' (called a kernel) 
with a parameter k' such that / is a yes-instance iff /' is a yes instance, k' < k, and |/'| < f{k) 
for some function / depending only on k. The most desired case is when the function / is 
polynomial, or even linear (then we say that the problem admits a polynomial or linear kernel). 
In such a case, when the parameter k is relatively small, the input instance, possibly very large, 
is "reduced" to a small one. In this paper by the size of the instance |/| we always mean the 
number of vertices. 

In the area of kernelization of graph problems the class of planar graphs (and more generally 
iJ-minor-free graphs) is given special attention. This is not only because planar graphs are 
models of many real-life networks but also because many problems do not admit a (polynomial) 
kernel for general graphs, while restricted to planar graphs they have a polynomial (usually even 
linear) kernel. A classic example is the 335fc-vertex kernel for the Planar Dominating Set 
due to Alber et al. [1]. In search for optimal results, and motivated by practical applications, 
recently researchers try to optimize the constants in the linear function bounding the kernel 
size, e.g. the current best bound for the size of the kernel for the Planar Dominating Set is 
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67k [2] . Such improvements often require nontrivial auxiliary combinatorial results which might 
be of independent interest. Our paper fits into this framework. 

Let G = {V, E) be a graph and let 5 be a subset of V . By G[S] we denote the subgraph of 
graph G induced by a set of vertices S. We say that S is nonseparating if G[V — S] is connected, 
otherwise we say that S is a separator. We focus on kernelization of the following problem: 

Maximum Nonseparating Independent Set (NSIS) Parameter: k 

Input: a graph G = {V,E) and an integer A; € N 

Question: Is there a nonseparating independent set of size at least k? 

In what follows, \V\ is denoted by n. This problem is closely related with Connected 
Vertex Cover (CVC in short), where given a graph G = {V,E) and an integer k we ask 
whether there is a set 5 C y of size at most k such that 5" is a vertex cover (i.e. every edge of G 
has an endpoint in S) and S induces a connected subgraph of G. The CVC problem has been 
intensively studied, in particular there is a series of results on kernels for planar graphs [6l [13] 
culminating in the recent ^k kernel [9]. It is easy to see that C is a connected vertex cover iff 
V — G is a nonseparating independent set. In other words, {G, k) is a yes-instance of CVC iff 
(G, n — k) is a yes-instance of NSIS. In such a case we say that NSIS is a parametric dual of 
CVC. An important property of a parametric dual, discovered by Chen et al [2], is that if the 
dual problem admits a kernel of size at most ak, then the original problem has no kernel of size 
at most (a/(a — 1) — e)A;, for any e > 0, unless P=NP. 

As we will see, the NSIS problem in planar graphs is strongly related to the Max Leaf 
problem: given a graph G and an integer fc, find a spanning tree with at least k leaves. 

Our Kernelization Results We study Planar Maximum Nonseparating Independent 
Set (Planar NSIS in short), which is the NSIS problem restricted to planar graphs. We show 
a kernel of size at most 9/c for Planar NSIS. This implies that Planar Connected Vertex 
Cover has no kernel of size at most (9/8 — e)fc, for any e > 0, unless P=NP. This is the first 
non-trivial lower bound for the kernel size of the PLANAR CVC problem. Our kernelization 
algorithm is very efficient: it can be implemented to run in 0{n) time. As a by-product of our 
considerations we also show a hk kernel for both Max Leaf and Planar Max Leaf, which 
in turn implies a lower bound of (5/4 — e)k for its parametric dual, i.e. Planar Connected 
Dominating Set. 

Our Combinatorial Results Some of our auxiliary combinatorial results might be of in- 
dependent interest. We prove that graphs that contain no separator consisting of only degree 
two vertices contain (a) a spanning tree with at least n/4 leaves and (b) a nonseparating in- 
dependent set of size at least n/9 (also, equivalently, a connected vertex cover of size at most 
|n). The result (a) is a generalization of a theorem of Kleitman and West [8] who showed the 
same bound for connected graphs of minimum degree three. As an another variation of this 
theorem, we show that every connected graph in which every edge has an endpoint of degree 
at least 3 has a spanning tree with at least n/5 leaves. Finally we show that every n- vertex 
outerplanar graph contains an independent set / and a collection of vertex-disjoint cycles C such 
that 9|/| > 4n - 3|e|. 

Previous Results There are a few interesting results on NSIS in cubic/subcubic graphs. 
Speckenmeyer [iTj and Huang and Liu [7] focus on cubic graphs and show some interesting 
relations between the size of maximum nonseparating independent set and the values of maxi- 
mum genus and the size of the minimum feedback vertex set. Ueno et al. |.12J showed that the 
problem is polynomial-time solvable for subcubic graphs by a reduction to the matroid parity 
problem. 

As the CVC is NP-complete even in planar graphs so is NSIS. To the best of our knowl- 
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edge there is no prior work on the parameterized complexity of Maximum Nonseparating 
Independent Set. The reason for that is simple: a trivial reduction from Independent Set 
(add a vertex connected to all the vertices of the original graph) shows that NSIS is W[l]-hard, 
i.e. existence of an algorithm of complexity 0{f{k) ■ \ V\'-^^^^) is very unlikely (and so is the exis- 
tence of a polynomial kernel). However, by general results on kernelization for sparse graphs [1], 
one can see that NSIS admits a 0{k) kernel for apex-minor-free graphs, so in particular for pla- 
nar graphs. However, the general approach does not provide a good bound on the constant 
hidden in the asymptotic notation. Observe that this constant is crucial: since we deal with an 
NP-complete problem, in order to find an exact solution in the reduced instance, most likely 
we need exponential time (or at least superpolynomial, because for planar graphs 2'^^^)-time 
algorithms are often possible), and the constant appears in the exponent. 

Max Leaf has been intensively studied. Although there is a 3.75A; kernel even for general 
graphs due to Estivill-Castro et al. [3j, some of their reductions do not preserve planarity. 
Moreover, the algorithm and its analysis are extremely complicated, while our method is rather 
straightforward. 

Yet another equivalent formulation of NSIS Consider the NSIS problem again. It is easy 
to see that if graph G has two nontrivial (i.e. with at least two vertices) connected components, 
then the answer is NO. Furthermore, an instance {G,k) consisting of a connected component 
C and an independent set / is equivalent to the instance {G[C],k — \I\). Hence, w.l.o.g. we 
may assume that the input graph G is connected. It is easy to see that the Maximum Non- 
separating Independent Set problem for connected graphs is equivalent to the following 
problem, which we name Maximum Independent Leaf Spanning Tree: 

Maximum Independent Leaf Spanning Tree Parameter: k 

Input: a graph G = {V,E) and an integer A; € N 

Question: Is there a spanning tree T such that the set of leaves of T contains a subset of 
size k that is independent in G? 

In what follows, we will use the above formulation, since it directly corresponds to our 
approach. 

Nonstandard terminology and notation By Ng{v) we denote the set of neighbors of vertex 
V. By a d- vertex we mean a vertex of degree d. 

2 A simple 12k kernel for Planar NSIS 

In this section we describe a relatively simple algorithm that finds a 12k kernel for Planar 
NSIS. This is achieved by the following three steps. First, in Section [2.11 we show a reduction 
rule and a linear-time algorithm which, given an instance (G, k), returns an equivalent instance 
{G',k') such that |V(G')| < |y(G)|, k' < k, and moreover G' has no separator consisting of 
only 2-vertices. Second, in Section 12.21 we show that G' has a spanning tree T with at least 
|y(G')|/4 leaves (and it can be found in linear time). Denote the set of leaves of T by L. 
Third, in Section [2.31 we show that the graph G'[L\ is outerplanar. It follows that G'[L\ has an 
independent set of size at least \L\/2> (which can be easily found in linear time) and, consequently, 
T has at least |y(G')|/12 leaves that form an independent set. Hence, if k' < \V{G')\/12 our 
algorithm returns the answer YES (and the relevant feasible solution if needed). Otherwise 
\V{G')\ < 12k' < 12k so {G',k') is indeed the desired kernel. 
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2.1 The separator rule 

Now we describe our main reduction rule, which we call separator rule. It is easier for us to 
prove the correctness of the rule for the CVC problem and then convert it to a rule for the NSIS 
problem. 

Separator rule Assume there is a separator S consisting of only 2- vertices. As 

long as S contains two adjacent vertices, remove one of them from S (note that S 
is still a separator). Next, choose any v & S such that the two neighbors a,b of 
V belong to distinct connected components of G[V — S]. If deg(a) = deg(6) = 1, 
remove a from G. If deg(a) = 1 and deg(6) > 2, remove a from G and decrease the 
parameter /c by 1. Proceed analogously when deg(6) = 1 and deg(a) > 2. Finally, 
when deg(a), deg(6) > 2, contract the path avb into a single vertex v' and decrease 
k by 2. 

Wc say that a reduction rule for a parameterized problem P is correct when for every instance 
(G, k) of P it returns an instance {G' , k') such that: 

a) {G', k') is an instance of P, 

b) (G, k) is a yes-instance of P iff (G', k') is a yes-instance of P, 

c) k' < k. 

Lemma 1. The separator rule is correct for Planar CVC. 

Proof. Since the separator rule modifies the graph by removing a vertex or contracting a path 
it is planarity preserving, so a) holds. The condition c) is easy to check so we focus on b), i.e. 
the equivalence of the instances. 

The case when dcg(a) = deg(6) = 1 are trivial so we skip the argument. 

Now assume deg(a) = 1, deg(6) > 2 (the case deg(6) = 1, deg(a) > 2 is symmetric). If C 
is a minimum connected vertex cover of G, \C\ < k, then v £ C and a ^ C. Since G[C] is 
connected, deg{v) = 2 and deg(6) > 2, also b E C. It follows that C \ {v} is a connected vertex 
cover of G' of size at most k' = k — 1. In the other direction, if C' is a connected vertex cover 
of G' , \C'\ < k' = k — 1, then b £ C' and clearly C' U {v} is a connected vertex cover of G. 

Finally, assume deg(a), deg(5) > 2. Let A and B be the connected components of G\y — S] 
that contain a and 6, respectively. Let oq (resp. 6o) be any neighbor of a (resp. b) distinct from 
V (oo and 6o exist since deg(a), deg(6) > 2). Note that ao £ A\J S , b^ £ B \J S and oq, 6o £ G' ■ 

Let us first assume that G' has a connected vertex cover C", \C'\ < k' . We show that G' 
has a connected vertex cover D', \D'\ < k' such that v' G D' . Then it is easy to check that 
C = {D' \ {v'}) U {a, V, b} is the required connected vertex cover of G, and |G| = \D'\ + 2 <k. 

If v' G G' we just put D' = C' so suppose that v' G' . Then ao,6o G C . Since G'[C'] is 
connected, there is a path P from ag to 6o in G"[G'], possibly of length 0. Since oq £ A\J S and 
6o G -B U 5 we infer that P contains a vertex w; G 5 n G' . It follows that D' = G' \ {w} U {v'} 
is a connected vertex cover of G' of size at most k' . 

Let us now assume that {G,k) has a connected vertex cover G, \G\ < k. If {a,v,b} C G, 
then clearly G' = {G\{a, v, b})U{v'} is a connected vertex cover of G' with \G'\ < k — 2 = k' . On 
the other hand, if {a, v, b} ^ G, then G[G U {a, v, b}] contains a cycle (because G is connected). 
Since 5 is a separator, this cycle has to contain some wi E S other than v. In this case we claim 
that G' = {G \ {wi}) U {a,v,b} is a connected vertex cover of G' and |G'| < A;'. It is clear that 
G' is a connected vertex cover. The size bound follows from the fact, that G has to contain two 
out of the three vertices {a, v, b}. □ 
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Now, we convert the separator rule to the dual separator rule as follows. Let {G,i) be 
an instance of (Planar) NSIS. Put k = \ V{G) \ — £, apply the separator rule to (G, k) and get 
{G',k'). Put f = \V{G')\ - k' and return (G',f ). 

Corollary 2. The dual separator rule is correct for PLANAR NSIS. 

Proof. The condition c) is easy to check while Lemma [T] implies a) and b). □ 

It is clear that the dual separator rule can be implemented in linear time. However, we 
would like to stress a stronger claim: there is a linear-time algorithm that given a graph G 
applies the separator rule as long as it is applicable. This algorithm can be sketched as follows. 
First we remove all 1-vertices that are adjacent to 2-vertices (note that this is a special case of 
the dual separator rule). Second, we find all maximal paths that contain 2-vertices only. For 
every such path, if it contains at least 3 vertices (and hence two of them form a separator), 
we replace it by a path of two vertices (this is again a special case of the dual separator rule) . 
It is easy to implement these two steps in linear time. Now, every 2-vertex has at most one 
neighboring 2-vertex. We remove all 2-vertices that do not have neighbors of degree 2 and for 
each pair of adjacent 2-vertices we remove exactly one of them. Next, we pick a connected 
component A of the resulting graph and we mark all its vertices. Then we consider all degree 
2 neighbors of this component (that has been removed). If such a neighbor v has an unmarked 
neighbor then it connects A with another component B. We apply the separator rule to the 
vertex v (in constant time) and we mark v as processed. Then we mark all the vertices of B. As 
a result the components A and B are joined into a new component A. In any case, the vertex 
V is not considered any more. We continue the procedure until the graph gets connected. All 
the removed 2-vertices that are not marked as processed are put back in the graph. Since every 
vertex of G is marked at most once, the whole algorithm works in linear time. 

2.2 Finding a spanning tree with many leaves 

Kleitman and West [8j showed how to find a spanning tree with at least n/4 leaves in a graph of 
minimum degree 3. In this section we generalize their result by proving the following theorem. 

Theorem 3. Let G he a connected n-vertex graph that does not contain a separator consisting 
of only 2-vertices. Then G has a spanning tree with at least n/4 leaves. Moreover, such a tree 
can be found in linear time. 

We will slightly modify the approach of Kleitman and West so that vertices of smaller degree 
are allowed. 

First note that it suffices to show a simplified case where G has no edge uv such that both 
u and V are 2-vertices (and we still assume the nonexistence of a separator consisting of only 
2-vertices). Indeed, if the theorem holds for the simplified case, we just remove from G all 
the edges uv such that both u and v are 2-vertices, call the new graph G. Note that G is 
connected and G does not contain a separator consisting of only 2-vertices (otherwise the old G 
contains such a separator). Hence we get a spanning tree T of G with at least |y(G)|/4 leaves 
by applying the simplified case. However, since 11^(6*) | = |l^(G)j and T is also a spanning tree 
of G, so T is also the required tree for the general claim. Hence in what follows we assume that 
G has no edge with both endpoints of degree 2. 

In order to build a spanning tree T our algorithm begins with a tree consisting of an ar- 
bitrarily chosen vertex (called a root), and then the spanning tree is built by a sequence of 
expansions. To expand a leaf v £ T means to add the vertices of Ng{v)\V{T) to T and connect 
them to v in T. Note that in a tree T built from a root by a sequence of expansions, if a vertex 
in V{G) — V{T) is adjacent with v G V{T), then v is a, leaf. 
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The order in which the leaves are expanded is important. To describe this order, we introduce 
three operations (operations 01 and 03 are the same as in [8j, but 02 is modified): 

(01) Applies when there is a leaf v G V(T) such that |A'^g'(^^) \ ^(^)l ^ 2. Then v is expanded. 

(02) Applies when there is a leaf v e V{T) such that \Ng{v) \ V{T)\ = 1 (let Ng{v) \ V{T) = 
{x}), and moreover \Ng{x) \ V{T)\ = or \Ng{x) fl V{T)\ > 2. Then v is expanded. 

(03) Applies when there is a leaf v G V{T) such that {Nciv) \ V{T)\ = 1 (let Ndv) \ V{T) = 
{x}), and moreover \Ng{x)\V{T)\ > 2. Then v is expanded and afterwards x is expanded. 

Now we can describe the algorithm for Theorem [3l which we call GENERIC: 

1. choose an arbitrary vertex r € 1/ and let T = {r}, 

2. apply 01-03 as long as possible, giving precedence to 01. 

We claim that GENERIC returns a spanning tree of G. Assume for a contradiction that at some 
point the algorithm is able to apply none of 01-03 but V{G) ^ ViT). Consider any leaf v T 
such that Ng{v) % ^(T")- Such a leaf exists because ViG) ^ V{T) and T is built by a sequence 
of expansions . Since 01 does not apply, \Ng{v) \ V{T)\ = 1. Let Ng{v) \ V{T) = {x}. Since 
02 does not apply, NG{x)r]V{T) = {v}. Since neither 02 nor 03 apply, \Ng{x)\V {T)\ = I. It 
follows that deg(^(x) = 2. Moreover, since there are no edges between 2-vertices, the neighbor 
of x outside of T is not a 2- vertex. It follows that Ui)eL(r) ^g{v)\V (T) is a separator consisting 
of 2-vertices, which is the desired contradiction. 

It remains to show that if a spanning tree T was constructed, then it has at least n/4 leaves. 
It can be done exactly as in the work of Kleitman and West [5]. However, we do it in a different 
way, in order to introduce and get used to some notation that will be used in later sections, 
where we describe an improved kernel. 

We say that a leaf u of T is dead if Ng{u) \ V{T) = 0. Note that after performing 02 there is 
at least one new dead leaf: if \Ng{x)\V{T)\ = then x is a dead leaf, and if \NG{x)riV{T)\ > 2 
then all of {Ng(x) fl V{T)) \ {v} are dead leaves, because of 01 precedence. For any tree T, by 
L{T) we denote the set of leaves of T. 

Let Xi be the set of the inner vertices of T that were expanded by an operation of type Oi. 
Let X be the set of the inner vertices of T; note that X = Xi U X2 U X3. Since T is rooted, the 
standard notions of parent and children apply. For a positive integer i, let Pi denote the set of 
vertices of T with exactly i children. 

Since every vertex besides r is a child of some vertex, we have ^(i>x d\P(i\ = n — 1. Since 
the set of vertices with one child is equal to X2 U (X3 n Pi) and {X^ n Pi [ = {X^D P>2 1 it follows 
that 

\X2\ + \XsnP>2\ + Y,d\Pd\=n-l. (1) 

d>2 

Since during 02 at least one leaf dies, \X2\ < \L{T)\. Similarly, since after expanding a vertex 
from X3 nP>2 the cardinality of L{T) increases, {X^ nP>2[ < 1-^^(7")! — 1- Finally, J2d>2 '^l^dl — 
J2d>2'^id-'^)\Pd\ = 2(|i(r)|-l). After plugging these three bounds to 1^ we get \L{f)\ > n/4, 
as required. This finishes the proof of Theorem [3l 

2.3 Outerplanarity 

Lemma 4. If G is a planar graph and T is a spanning tree of G, then the graph G[L{T)] is 
outerplanar. 
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Proof. Fix a plane embedding of G and consider the induced plane subgraph G' = G[L{T)]. 
Since T is connected, all vertices of L{T) lie on the same face of G' . Therefore G' is outerplanar. 

□ 

Corollary 5. If G is a planar graph and T is a spanning tree of G then there is a subset of 
leaves of T of size at least \L(T)\/3 that is independent in G (and it can be found in linear 
time). 

Proof. It is well-known that outerplanar graphs are 3-colorable and the 3-coloring can be found 
in linear time. So, by Lemma[l]we can 3-color G[L(T)] and we choose the largest color class. □ 

3 A 9k kernel 

In this section we present an improved kernel for the Maximum Independent Leaf Spanning 
Tree problem. Although the analysis is considerably more involved than that of the 12k kernel, 
the algorithm is almost the same. We need only to force a certain order of the operations 01-03 
in step 2. As before, the algorithm always performs the 01 operation if possible (we will refer 
to this as the 01 rule). Moreover, if more than one 01 operation applies then we choose the 
one which maximizes the number of vertices added to T (we will refer to this as the largest 
branching rule). If there is still more than one such operation applicable then among them 
we choose the one which expands a vertex that was added to T later than the vertices which 
would be expanded by other operation (we will refer to this as the DPS rule). Similarly, if there 
are no 01 operations applicable but more than one 02/03 operations apply, we also use the 
DFS rule. The algorithm GENERIC with the order of operations described above will be called 
BRANCHING. 

Note that the algorithm BRANCHING is just a special case of GENERIC, so all the claims 
we proved in Section [2] apply. Let us think where the bottleneck in this analysis is. There are 
two sources of trouble. First, if there are many 02/03 operations we get a spanning tree with 
few leaves: in particular there might be only 02 operations and 03 operations that add just 
two leaves (consider a cubic graph obtained from the cycle vqVi . . . v^k-i^o by adding k vertices 
it^o, • • • , Wk-i and for each i = 0, . . . , /c — 1 add the following edges: WjUsj, WjUsj+i, i(;if3j+2) and 
we get roughly n/4 leaves. Second, if the outerplanar graph G[L{T)] is far from being bipartite 
(i.e. has many short odd cycles) then we get a small independent set: in particular, when 
G[L{T)] is a collection of disjoint triangles, the maximum independent set in G[L{T)] is of size 
exactly |L(r)|/3. However, we will show that these two extremes cannot happen simultaneously. 
More precisely, we prove the following two theorems. 

Theorem 6. Let G be a connected n-vertex graph that does not contain a separator consisting 
of only 2-vertices. Then G has a spanning tree T such that if Q is a collection of vertex- disjoint 
cycles in G[L{T)], then 

n + 3|e| 

\L{T)\ > 

Moreover, T can be found in linear time. 

Theorem 7. Every n-vertex outerplanar graph contains 

• an independent set I, and 

• a collection of vertex- disjoint cycles C 
such that 91/1 > 4n - 3161. 
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Note that the bound in Theorem[7|is tight, which is easy to see by considering an outerplanar 
graph consisting of disjoint triangles. From the above two theorems and Lemma H] we easily get 
the following corollary. 

Corollary 8. Let G be a connected n-vertex graph that does not contain a separator consisting 
of only 2-vertices. Then G has a spanning tree T such that L{T) has a subset of size at least 
n/9 which is independent in G. Equivalently, G has a nonseparating independent set of size at 
least n/9 and a connected vertex cover of size at most |n. 

By a similar reasoning as in the beginning of Section [2] we get a 9fc-kernel for the Maximum 
Independent Leaf Spanning Tree problem. In what follows, we prove Theorem [6] and 
Theorem [71 

3.1 Proof of Theorem [6] 

Note that similarly as in Theorem [3] it suffices to show a simplified case when G has no edge 
uv such that both u and v are 2-vertices. Indeed, if the theorem holds for the simplified case, 
as before we create a new graph G by removing from G all the edges with both endpoints of 
degree 2 and as before G does not contain a separator consisting of only 2-vertices. Then we 
apply the simplified case and we get a spanning tree T such that for any collection C of vertex- 
disjoint cycles in G[L{T)], we have \L{T)\ > {\V{G)\ + 3|e|)/4 and T is a spanning tree of G 
as well. Moreover, no edge of E{G) \ E[G) belongs to a cycle in G[L{T)\ for otherwise both of 
its endpoints have degree at least 3 in G. Hence if S is a collection of vertex-disjoint cycles in 
G[L(T)] then it is also a collection of vertex-disjoint cycles in G'[L(T)], so the desired inequality 
holds. Hence in what follows we assume that G has no edge with both endpoints of degree 2. 
Since we proved that in this case the algorithm GENERIC returns a spanning tree, and each 
execution of BRANCHING is just a special case of an execution of GENERIC we infer that 
BRANCHING returns a spanning tree of G, which will be denoted by T. 

Let C be an arbitrary collection of vertex-disjoint cycles in G[L[T)]. Our general plan for 
proving the claim of Theorem [6] is to show that if |C| is large then we have few 02/03 operations 
— by ([T|) this will improve our bound on \L{T)\. To be more precise, let us introduce several 
definitions. 

Recall the 02 operation: it adds a single vertex x to T and at least one leaf of T dies. We 
choose exactly one of these dead leaves and we assign it to x. However, if the vertex x dies we 
always assign x to itself (so if some other leaves die during this operation, they are unassigned). 
Let Lu be the set of unassigned leaves of T. Clearly, IX2I = 1^(7") | — \Lu\- In order to show 
that there are few 02 operations, we will show that \Lu\ is big. 

Let xi,X2, . . . x\x\ be the inner vertices of T in the order of expanding them (in particular 
xi = r). A run is a maximal subsequence Xb, x^+i, . . . , Xe of vertices from P>2, i.e. the nodes in 
T that have at least two children. 

Lemma 9. Vertices of any run R = xi,, . . . ,Xe form a subtree of T rooted at x^. 

Proof. Assume that a vertex x € {x^+i, . . . , Xg} has the parent Xp outside the run. Then 
p < b and x was a leaf in T while Xb was being expanded. Hence, by the definition of a run, 
Xb-i G Pi, and in particular Xf,_i was expanded by 02 or 03. However, when this operation 
was performed, it was possible to expand x by 01, a contradiction with the 01 rule. Hence 
every vertex x € {x^+i, ■ ■ ■ ,Xe} has the parent in the run, which is equivalent to the claim of 
the lemma. □ 
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In what follows, the subtree from Lemma is denoted by Tr. Moreover, let ch(rR) denote 
the set of children of the leaves of Tr, i.e. 

ch(TR) = {v G V{T) \ V{Tfi) : Tr contains the parent of v}. 

We say that a run R opens a cycle C in C if the first vertex of C that was added to T belongs 
to ch(TR). The following lemma shows a relation between cycles in C and runs. 

Lemma 10. Every cycle in C is opened by some run. 

Proof. Consider any cycle C G C and let v be the first vertex of C that is added to T. Note that 
V is not added by 02, for otherwise just after adding v to T, v has at least two neighbors and 
by the 01 rule v would be the next vertex expanded and hence not a leaf of T, a contradiction. 
It follows that V is added by 01 or 03 and consequently v G ch(TR) for some run R. □ 

Now we can sketch our idea for bounding the number of 03 operations (^^03). Both after 
01 and 03 the cardinality of L{T) increases. Hence, if we fix the number of leaves in the final 
tree, then if is large then ^^03 should be small. Since a run contains at most one vertex 
of 1 (e.g. by Lemma [9]), it means that a tree Tr with a large number of children contains 
plenty of vertices from |^i|. We will show that if a run opens many cycles, then indeed |ch(rR)| 
is large. Let C/j denote the set of cycles in S opened by R. 

Lemma 11. Let R be a run. For any cycle C € Qr one of the following conditions holds: 

(i) \ch{Tn) n V{C)\ + \Lu n V{C)\ > 4, or 

(ii) |ch(r/j) n V{C)\ + \Lu n V{C)\ = 3 and \R n P>3| > 1. 

Proof. Let vi be the vertex of C that is added first to the tree T. By the definition of C/j, 
vi £ chiTji). We see that at least one neighbor of vi, call it V2, is in ch(rR), for otherwise just 
after expanding the last vertex of R the vertex vi can be expanded by 01, while the algorithm 
chooses 02/03, a contradiction with the 01 rule. 

Let w be the neighbor of V2 on C that is distinct from vi. Assume w ^ ch(TR). Then just 
after expanding the last vertex of R we have N(v2) \ T = {w}, since if \N(v2) \ T| > 2 then it 
is possible to expand V2 by 01. Hence if V2 is assigned then it is assigned to w. However, then 
w is added to T by 02 so w dies during this operation (otherwise w is expanded because of the 
DFS rule so w ^ L{T)), and hence w is assigned to w and V2 € Lu- To conclude, w G ch(TR) or 
V2 G Lu- 

If we denote by u the neighbor of vi on C that is distinct from V2, by the same argument 
we get u G ch(TR) or vi G L^. 

It follows that (i) holds, unless u = w (i.e. C is a triangle), vi,V2 and w G ch(rR). Let 
us investigate this last case. We see that |ch(rR) n F(C)| = 3. We will show that |i?nP>3| > 1. 
Since V2,w G ch(rR), they could not be added by 02 and hence vi is assigned to a vertex 
X V{C). Note that x is added to T after V2 and w. Assume w.l.o.g. that V2 was added to T 
before w. We consider two cases. If V2 was not added to T by expanding the parent of vi, then 
the parent p of V2 has at least three children (otherwise instead of expanding p the algorithm 
can expands vi and add at least three children, a contradiction with the largest branching rule), 
so \R n P>3| > 1 as required. Finally, if V2 was added to T by expanding the parent p of vi, 
then p G P>3 for otherwise just after expanding p 01 is applicable to vi so either vi or V2 is 
expanded by the DFS rule. This concludes the proof. □ 

By applying Lemma [11] to all cycles of a single run R we get the following corollary. 
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Corollary 12. For any run R that opens at least one cycle, 

\cHTr) n ^(6^)1 + \Lu n vieR)\ + \RnP>3\> 3\eR\ + 1. 

Lemma 13. For any run R, 

\Lur)V{QR)\ + Y,{2d-3)\Rr\Pd\-\Rr\X^i\>'i\QR\. (2) 

d>2 

Proof. CASE 1: ch(TR) C V{Qr). We claim that then V{Qr) C ch(rR) and so V{Qr) = 
ch(rR). If there is a cycle C G Qr with a vertex outside ch(TR) then C has a vertex in ch(TR) 
with a neighbor in ^(C) \ ch(TR). Among all such vertices for all cycles of Qr choose the one 
that was added to T last, call it v. We see that after expanding all vertices of R the algorithm 
expands v (by the DFS rule and because ch(rR) C l/(Ci?)), a contradiction because v G L{T). 
Hence indeed ^(6/?) C ch(rH). 

Now assume that there is f G V{Qr) such that v L„. Among all such vertices choose the 
one that was added last to T, call it u. Note that u is assigned to a vertex that was added to 
T after the run R. We get a contradiction again, since after expanding all vertices of R the 
algorithm expands u. To conclude, V{Qr) C L„, hence \Lu fl V{Qr)\ = Y^c&&r I^(C)I > 3|e/j|. 
Since \R fl P>2| > |i? n A3I, the claim follows. 

CASE 2: ch(r^) 2 y(eij). Then, \ch{TR)r\V {Qr)\ < |ch(r^)|-l. Since \ch{TR)\ = Ed>2d\Rn 
Pd\ - {\R\ - 1), we get 

Y,\RnPd\{d-i)>\cHTR)nv{eR)\. (3) 

d>2 

Now observe that the LHS of Q is always nonnegative, so the claim holds when Qr = 0. Hence 
we can assume that S/j 7^ and the bound of Corollary 1121 applies . Note also that |iin A3I < 1, 
which follows e.g. from Lemma [9l Then, 

\Lu n vieR)\ + ^ |i? n Pd|(2(i - 3) - |i? n A3I > 

d>2 

\Lu n ^(6^)1 + Y.\Rn Pd\{d - 1) + |i? n P>3| - 1 >® 

d>2 

\Lu n V{Qr)\ + |ch(TR) n V{Qr)\ + |i? n P>3| - l >(C°>-oIIary[l3 

□ 

Now we are ready to prove the claim of Theorem [6l i.e. that \L{T)\ > (n + 3|C|)/4. 
Let us add Yl,d>2i'^d — 3)|Prf| to both sides of ([1]): 

|A2| + |A3nP>2|+3j^(d-l)|Pd| =n-l + J^(2a!-3)|Pd|. (4) 

d>2 d>2 

Since IA2I = \L{T)\ — \Lu\ and X]d>2(^ ~ = 1-^(^)1 — 1 we get 

4|L(r)| = n + 2 + \Lu\ + Y.i'^d - 3)\Pd\ - IA3 n P>2\. (5) 

d>2 

By Lemma [10] after summing ([2]) over all runs we get 

\Lu\ + ^i2d - 3)\Pd\ - IA3 n P>2\ > 3\e\. (6) 

d>2 

The claim follows immediately after plugging ([6]) to ([5]). 
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3.2 Proof of Theorem [7] 

Theorem [7] (restated). Every n-vertex outerplanar graph contains 

• an independent set I, and 

• a collection of vertex- disjoint cycles C 
such that 9\I\ > 4n - 3|e|. 

Proof. Let H = {V, E) be the outerplanar graph under consideration. We use the induction on 
If |y| = we put / = C = and the claim follows. Hence we can assume that H has at 
least one vertex. 

CASE 1: There is a vertex v such that deg^j v <\. 

Let H' be the graph obtained from H by removing v and its neighbor, if any. We apply 
induction to H' and we get relevant Iq, Cq such that 

9|/o| > 4n-8-3|eo|. (7) 
We set / = Jo U {v} and C = Cq. Clearly, / is an independent set and we have 

9|/| = 9([/o| + 1) > 4n + 1 - 3iei. 

This finishes Case 1. 

Hence, in what follows we assume that every vertex of H has degree at least 2. Now, consider 
a leaf block Q of H, i.e. a block that contains at most one cutvertex of H. Note that Q is not 
a single edge, since then Case 1 applies. Fix any outerplanar embedding of Q and let / be the 
outer face of Q in this embedding. Let D{Q) be the graph dual to Q and let Tq be the graph 
obtained from D{Q) by removing vertex /. Since Q is outerplanar, Tq is a tree (in particular 
Tq is simple). A face of Q that is a leaf of Tq will be called a leaf-face. 

CASE 2: Block Q contains a leaf-face of odd length / = viV2 ■ ■ - Vi such that for i = 2, . . . — l 
we have deg fj{vi) = 2. 

We apply induction to H — V{f) and we get relevant Jq, Cq such that 

9|/o| > 4n-4£-3|eo|. (8) 

We set I = IqU {^2,^4, • • • ,^^-1} and C = Co U {/}. Note that since Q is not a single edge, 
degQ(f) > 2 for every vertex w £ Q, so if degjj{v) = 2 then v is not a cut vertex in H. In 
particular none of the vertices V2,V4, . . . , V£-i is a cutvertex. It follows that / is an independent 
set and we have 

9|/| = 9 (^\Io\ + >4n- 3[eoI + - 1) - 4£ = 4n - 3|e| + > 4n - 3|e|. 

CASE 3: Block Q contains a leaf-face of even length / = V1V2 ■ ■ - Vi such that for i = 2, . . . — l 
we have degfj{vi) = 2 and degfj{vi) = degQ{vi) = 3. 

Let w be the neighbor of distinct from vi, . . . , Vi-i. We apply induction to H—{V{f)U{w}) 
and we get relevant Iq, Qq such that 

9|Io| > 4n-4£-4-3|eo|. (9) 
We set I = IqU {^2,^4, • • • , Vi} and C = Cq U {/}. Clearly, / is an independent set and we have 

p\ 9 £ — 2 

9\I\ = 9 ( Uol + 2 ) > 4n - 3|eo| + - 4£ - 4 = 4n - 3|e| + > 4n - 3|e|. 
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CASE 4: Block Q contains two leaf-faces of even length with a common vertex, fi = viV2 ■ ■ - vi 
and /2 = W1W2 ■ ■ - Wk with = w^, such that for i = 2, ...,£ — 1 we have degij(vi) = 2, for 
i = 2, . . . , k — 1 we have degjj{vi) = 2 and degff{vi) = degQ(f^) = 4. 

We apply induction to H — {V (fi) U V {f2)) and we get relevant Iq, Cq such that 

9|/o| > 4n-4(A; + £- 1) -3|eo|. (10) 

We set / = IqLI{v2, V4, . . . , ve}U{w2, W4, ... , Wk} and C = CoU{/i}. Clearly, I is an independent 
set and we have 

l^ol + ^ + f -Ij >4n-3|eo| + ^-5 = 4n-3|e| + ^^^ ^ >4n-3|e|. 

CASE 5: none of Cases 1-4 applies. 

We show that Q has at most two bounded faces. 

First, assume that Tq has at least two non-leaves. Then, there is a non-leaf /* such that 

• /* has exactly 1 non-leaf neighbor /** in Tq, and 

• for every leaf neighbor / of /*, if / is incident with a cutvertex x in H, then x is incident 
also with both /* and /** . 

Note that by Case 2 all leaf neighbors of /* are of even length. Moreover, by Case 3, /* has 
|/*| — 1 > 2 leaf neighbors. Hence, Case 4 applies, a contradiction. 

Now, assume Tq has exactly one non-leaf /*. Then, /* has at least 2 leaf neighbors in 
Tq. By Case 2, at least one of them, call it /, is of even length (note that /* may have a leaf 
neighbor /' of odd length which contains a vertex v such that v is not incident with /* and v is 
a cutvertex of H: then deg^(u) > 2 and Case 2 would not apply to /', but if there is another 
odd leaf neighbor of /* then Case 2 applies to it). Since Case 3 does not apply to /, so (a) / is 
incident with another two leaf neighbors /' and /" of /* or (b) / is incident with exactly one 
leaf neighbor /' and / has a cutvertex x oi H such that x is not incident with /'. In situation 
(a), by Case 2 and since Q has at most one cutvertex of H, at least one of these leaf neighbors 
is of even length. But then Case 4 applies, a contradiction. In situation (b), /' is of even length 
by Case 2 and hence Case 4 applies to / and /', a contradiction. 

It follows that Tq has only leaves, i.e. Tq is a single leaf or two adjacent leaves (in other 
words Q has at most two bounded faces). 
CASE 5a: |F(rQ)| = 1, i.e. Q is a single facial cycle C. 

Note that |C| is even for otherwise Case 2 applies. Then we apply induction to H — V{C) 
and we get relevant lo, Co such that 

9|/o| >4n-4|C| -3|eo|. (11) 

We set C = Co and we set / to be the independent set obtained from Iq by adding every second 
vertex of C : this can be done in two ways and we choose a way that does not add a cutvertex 
ofH. 

9\I\ = 9 ^|/o| + >4n- 4|C| - 3|C| + ^|C| = 4n - 3|C| + > 4n - 3|C|. 

CASE 5b: |F(Tq)| = 2, i.e. Q has two bounded faces /i, /2 and they share an edge. 

Since Case 2 does not apply, at least one face, say /i is of even length. If both /i and /2 
are even then we apply induction to H — V{Q) getting relevant Iq and Co and we create / by 
extending Iq by every second vertex of the outer face of Q so that a cutvertex of H, if any, is 
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not added. By the same calculation as in Case 5a we get the desired inequality. Hence we can 
assume that /i is even and /2 is odd, but we will show that this last case cannot happen. Then 
a cutvertex of H belongs to V{fi), for otherwise Case 3 applies. But then Case 2 applies, a 
contradiction. This concludes the proof of Theorem [71 □ 

4 A simple 5k kernel for (Planar) Max Leaf 

In this section we show a simple kernelization algorithm for Max Leaf and Planar Max 
Leaf. Below we describe three simple rules, which preserve planarity. 

• (1, 2)-rule If there is a 1-vertex u adjacent with a 2-vertex v then remove v. 

• Adjacent 2- vertices Rule Assume that there are two adjacent 2-vertices u and v. If uv 
is a bridge, contract uv, otherwise remove uv. 

• Trivial Rule If G consists of a single edge, return YES if A; < 2. 

It is quite clear that the above rules are correct for (Planar) Max Leaf (see e.g. [LQ] , 
Rules 1-3 for a proof). Note that if none of our rules applies to a connected graph G, then every 
edge of G has an endpoint of degree at least 3. 

Theorem 14. Let G be a connected graph in which every edge has an endpoint of degree at 
least 3. Then G has a spanning tree with at least n/5 leaves. 

Proof. We proceed similarly as in the proof of Theorem [3l As before, we start from a single 
vertex r and we build a tree T by applying operations 01-03 as long as possible. Then we 
perform a new operation 04, which expands any leaf w of T which is not dead. Hence if T is 
not spanning, one of operations 01-04 is applicable. The algorithm builds the tree by applying 
01-04, but 04 is performed only if none of 01-03 applies. Let T be the final spanning tree of 
G. 

Now we show a lower bound on |L(T)|. Using the notation from Section [2.21 we can state 
an analog of ([T]): 

|X2| + |X3nPi| + |X4| + ^(ijPd| = n-l. (12) 

d>2 

Consider an operation 04, which expands a leaf v of T. As we have shown in Section 12. 2[ 
\Ng{v)\V{T)\ = {x} for some x of degree 2 in G and moreover the neighbor z of x distinct from 
V is outside T. Since every edge has an endpoint of degree at least 3, deg(z) > 3. It follows that 
just after expanding v by 04, x is expanded by operation 03. Hence {X^l < jXsnPil. Moreover, 
in Section[2i2]we have shown that IX3 nPi| = IX3 nP>2l < \L{T)\ - 1 and \X2\ < \L{T)\. After 
plugging these three bounds to ([T2]) we get |i^(r)| > n/5, as required. □ 

Theorem [131 is another variation of the Kleitman-West result which might be of independent 
interest. Note that the bound of Theoremll4lis tight up to an additive constant, which is shown 
by the following family of graphs G^. Begin with a cycle vqVi . . . U4fc_ifo- Then add k vertices 
Wo, . . . , Wk-i. Finally, for each i = 0, . . . , k — 1 add the following edges: WiV^i+i, WiV4i+2, WiVa+s. 
It is easy to see that every edge of Gk has an endpoint of degree at least 3. It is also easy 
to see that if is a spanning tree of Gk with maximum possible number of leaves, then 
lim\L{n)\/\V{Gk)\ = 5. 

Now we can describe our kernelization algorithm. Let G' be the graph obtained from G by 
applying our three rules as long as one of them applies. By Theorem [TH if /c < n/5 we can 
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return the answer YES. Hence n < 5k and G' is a 5A;-kernel for Planar Max Leaf and Max 
Leaf. 
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